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Abstract — Frame Synchronization (FS) is required in several commu- 
nication standards in order to recover the individual frames that have 
been aggregated in a burst. This paper proposes a low-delay and reduced- 
complexity Sliding Trellis (ST)-based FS technique, compared to our 
previously proposed trellis-bused FS method. Each burst is divided into 
overlapping windows in which FS is performed. Useful information is 
propagated from one window to the next. The proposed method makes 
use of soft information provided by the channel, but also of all sources 
of redundancy present in the protocol stack. An illustration of our ST- 
based approach for the WiMAX Media Access Control (MAC) layer is 
provided. When FS is performed on bursts transmitted over Rayleigh 
fading channel, the ST-based approach reduces the FS latency and 
complexity at the cost of a very small performance degradation compared 
to our full complexity trellis-based FS and outperforms state-of-the-art 
FS techniques. 

Index Terms — CRC, Cross-layer decoding, Frame synchronization, 
Joint decoding, Segmentution. 

I. Introduction 

FS is an important problem arising at various layers of the protocol 
stack of several communication systems. The most obvious being, at 
Physical (PHY) layer, to recover the payload and the side information 
(headers, etc..) of PHY packets or frames. First results UJ, (2) 
considered data streams in which regularly spaced fixed patterns 
or Synchronization Words (SW) are inserted to delimit fixed-length 
frames, as is the case, e.g., at the PHY layer of Digital Video 
Broadcasting - Handheld (DVB-H) for MPEG2 transport stream 
frames. Synchronization tools based on the maximization of the 
correlation between the SW and the received data have been proposed 
in UJ. This has been improved in (2), where the optimal statistic for 
FS has been proposed for the AWGN case, taking into account the 
presence of data around the SW. This was further extended in (3) for 
more sophisticated transmission schemes and in |4), to provide 
robustness against frequency and phase errors. 

In many communication systems, frames are of variable-length, 
see, e.g., the 802.11/802.16 standards (6), (7) for Wireless Local Area 
Networks (WLANs). In absence of SW, the Header Error Control 
(HEC) field of the header has been employed in (8) to perform FS 
with an automaton adapted from J5J- A length field, assumed present 
in the frame header, facilitates the FS when the noise is moderate. In 
1101 . 1111 . several hypothesis testing techniques have been proposed 
to perform FS in presence of SW, which are further extended in 
1121 to exploit a priori information on the prevalence of ones and 
zeros in the payload at the price of a small additional signaling 
and computational complexity. However, insertion of SW requires 
some modification to the transmitter, which might not be standard 
compliant. Most of these FS techniques work on-the-fly, i.e., at each 
time, only few data samples are processed to perform FS and almost 
no latency is introduced. 

Initially, FS has mainly been considered at PHY layer, albeit this 
problem may also occur at upper layers of the protocol stack. In fact, 
some frame aggregation techniques at intermediate protocol layers 
have been proposed recently in order to reduce the signalization 
overhead, see, e.g., 11131 in the context of 802.11 standard. Efficient 
FS in this context is thus very important, since if some frames are not 



correctly delineated, a large amount of bits has to be retransmitted. In 
this context, processing a whole burst at each step may significantly 
improve the FS performance. This was evidenced in 1141 for the 
segmentation of MAC frames aggregated in WiMAX PHY bursts, 
where Joint Protocol-Channel Decoding (J PCD) is performed using 
a modified BCJR algorithm 1151 to obtain the frame boundaries. It 
exploits all available information: soft information at the output of the 
channel (or channel decoder) as well as the structure of the protocol 
layers (SW, known fields in headers, presence of Cyclic Redundancy 
Check (CRC) or checksums, etc). 

This paper proposes an adaptation of the reduced-complexity 
Sliding Window (SW) [16] variant of the BCJR algorithm, presented 
for the decoding of convolutional codes, to develop a low-delay and 
reduced-complexity version of the FS technique presented in 1141 . 

The FS problem is first stated as a maximum a posteriori (MAP) 
estimation problem in Section [III Then, Section [TIT] reformulates 
the trellis-based technique for FS introduced in 1141 . The proposed 
Sr-based algorithm is presented in Section |IV| and is illustrated in 
Section|V]with the FS of WiMAX MAC frames a ggregated in bursts. 

II. MAP ESTIMATION FOR FRAME SYNCHRONIZATION 

A. Frame structure 

Consider the n-th variable-length frame at a given protocol layer. 
This frame is assumed to contain A n = lh + £ p>n bits, where the 
leading £h bits represent the frame header, of fixed length, and the 
remaining £ Pt „ bits constitute the variable-length payload. In the 
header, £ c bits are some HEC bits: CRC or checksum. The length A n 
is assumed to be a realization of a stationary memoryless process A 
characterized by 

7ta = Pr (A = A) / for £ 

mill max ■ (l) 

where £ m i n and ^ max are the minimum and maximum length in bits 
of a frame. 

The header h n of the n-th frame can be partitioned into four fields. 
The constant field k, contains all bits which do not change from 
one frame to the next. It includes the SW indicating the beginning 
of the frame, and other bits which remain constant 1 1 71 once the 
communication is established. The header is assumed to contain a 
length field u n , indicating the size of the frame in bits X n , including 
the header. Our task is to estimate the successive values taken by 
this quantity in all frames of the burst. The other field o n , gathers 
all bits of the header which are not used to perform FS. Finally, the 
HEC field c„ is assumed to cover the £h — £ c "working" bits of the 
header, i.e., c n = f (k, u n , o), where f is some (CRC or checksum) 
encoding function. The payload (assumed not protected by the HEC 
field) of the n-th frame is denoted by p n . It is modeled here as a 
binary symmetric sequence. 

In what follows, the length of a vector z (in bits) is denoted as £ (z) 
and its observation (soft information) provided either by a channel, 
a channel decoder, or a lower protocol layer is denoted as y z . z a 
represents the sub- vector of z between indexes a and b (in bits). 




Fig. 1. L = 16 bits with 
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B. Aggregated frames within a burst 

Consider a burst of L bits consisting of N aggregated frames. This 
burst contains either A — 1 data frames and an additional padding 
frame containing only padding bits, or A data frames. Assume that 
each of these frames, except the padding frame, contains a header and 
a payload and follows the same syntax, as described in Section Hi- Al 

Assuming that L is fixed before frame aggregation and that A is 
not determined a priori, the accumulated length in bits £ of the n 
first aggregated frames can be described by a Markov process, which 
state is denoted by S n . With this representation, the successive values 
taken by S„, for n — 0, 1, . . . can be described by a trellis 1141 
such as that of Figure [T] with a priori state transition probabilities 
p(S„ = £\ Sn-i = (') deduced from Q}. If I < L, then 



p(s n = e\s n - 1 =£') = 

and if £ = L, then 

P (S n = L j Sn-i = £') 



n t - t , if Imin < £ ~ (! S: In 

else, 



r 0,if L-£' >£ max 
1, if < L - £' < i min 



= < 



(2) 



(3) 



2 7r fc , else. 

I k-L-l' 



In the trellis, dashed transitions correspond to padding frames and 
plain transitions correspond to data frames. 

C. Estimators for the number of frames and their boundaries 
Consider a burst of N aggregated frames and some vector yf 

containing soft information about the bits of xf . Here, one assumes 

that the first entry of yf corresponds to the first bit of xf\ This 

assumption is further discussed in Section IlII-CI 

The suboptimal MAP estimator presented in 1141 consists in first 

estimating A and then in estimating the locations of the beginning 

and of the end of the frames. The MAP estimate Amap of A is given 

by 



Amap = arg 



max 

„<n<N n 



P(S n 



L\yi 



(4) 



with N min = rL/^maxl^A max = \L/£ mia ], and [■] denoting the 
upward rounding. Once Amap is obtained, the MAP estimate for the 
index £ n of the last bit of the n-th (n = 1, Amap) frame is 

(5) 



£„ = arg max P [S n = £\y 1 
and the length X n of the n-th frame is estimated as 

A„ = arg max P (^S n = ^yi') — arg max P (^S n -i 



£\yi 



III. Trellis-based FS Algorithm 

In {5}, and ©, one has to evaluate P(S„ = £\yi) for all 
possible values of n and £. This can be performed efficiently using 
the BCJR algorithm 1151 . by evaluating first 



P(S n = £,y?) = a n (£)p n (£) 



where 



and 



a n (£) = P(S n = £,y[) = Y,a n - 1 (£') 7n (£',£), 
i' 

p n {£) = P(yf +1 \S n = I) =Y / Pn+x{£')iu+i{£,£'), 

I' 

ln {£',£) =P(Sn=t,yi>+i\S«-i=t). 



(7) 

(8) 
(9) 

(10) 



Classical BCJR forward and backward recursions allow to evaluate 
a and f3. The initial value 5o of the forward recursion is known, 
leading to oto(£ = 0) = 1 and ao(£ 7^ 0) = 0. For the backward 
recursion, assuming that all allowed final states are equally likely 
(which is a quite coarse approximation), one gets 

fin(L) = — — — , A min < n < A max . (11) 

J* max -/V m in 1 I 

All other values of Pn(£), for £ < L are initialized to 0. 

A. Evaluation of y n 

Two cases have to be considered for evaluating 7„ (£',£)■ 
When £ < L, the transition corresponding to the n-th frame cannot 
be the last one, thus corresponds to a data frame. Assuming that 
-£min ^ I— i ^ ^max, the bits between £' + 1 and I may be interpreted 
as x^ +1 = [k, u„, o, c, p], where u n = u (£ — £') is the binary 
representation of I— £'. The corresponding observation can be written 
as y{ l+1 = [ytjajojcjp]. With these notations, for £ ^ L, 
7n (£',£) = ji(£',£),wth 

yi(£',£) = p(S n =e\S n - 1 =£')v d (y^+i,^ +1 ),(12) 



where 

d 

<P 



(A'+iM'+i) = P (Yfclk) P (y«|u {I - £')) 

J2 p (y°l°) p (yc|c = f (k, u (£ - £') , o)) p (o 

o 

^P(y p |p)P(p). 



(13) 



Under the assumptions described above and for a memoryless AWGN 
channel with variance a 2 , we get 



P (y„|u {£-£')) =P(y„K)= \[ "4 



-(y»(i)""»(i))72» 



2-Ko- 



(6) 



Assuming that the values taken by p are all equally likely, one gets 
P(p) = 2~ f(p) . 

In ()13|) . the sum over all possible o can be evaluated with a 
complexity 0(£ (o) 2 1 ^), as proposed in 1171 . see also 1141 for more 
details. 

When £ = L and L — £' < ^ ma x, the n-th frame is the last one, 
and x^ /+1 = 1 has also to be considered in j n (£' , L), leading to 

7n (£', L) = p ( s " = L > yi+^ p n = P\Sn-i = (!) 

P=0,1 

= 1 i{( l ,L)P{P n = G\S n ^ 1 =£') 

+ 1 l(£',L)P(P n = l\S n -i=£'), (14) 



where P n is a random variable indicating whether the n-th frame is 
a padding frame and its a priori probability is given by 



P(Pn = l\Sn-l=t) 



In CD, 



= < 



r a if l - e > ^ max 

1, if < L - I' < imin 



w (15) 
tt a , else. 

A=L-f' + l 



7^V) = p(s„=L, y f, +1 \s n -i=e,p„ = i) 

= P(yf/+i|Pn = l,S'»-i=i',S„=i) (16) 
accounts for the padding frame, while 
jt(£',L) = p(Sn=Ly t , +1 \S n -i=t,Pn = 

= P (Sn = L\Sn-l = f, P n = 0) <p A (yi'+iM'+i 
accounts for the data frame, with 

P (Sn = L\S n -l = £', Pn=0)= l^~ V ■ 

A=£ min 

S. Complexity evaluation 

The complexity of the FS algorithm described in Section [TT] is 
proportional to the number of nodes or to the number of transitions 
within the trellis on which FS is performed. From Figure [TJ one sees 
that the trellis is lower-bounded by the line £ = n£ m in and upper- 
bounded by the lines i — n^ max and £ = L. This region may be 
divided into two triangular sub-regions, one with < n < iV m i n — 1, 
bounded between £ = nimin and £ = n^ max , and the other with 
^Vmin < n < iVmax — 1, bounded between £ = n£ m in and £ = L. 
Thus, summing up the number of nodes in each sub-region, one gets 
the number of nodes in the trellis 

^min-l JVmax 
A/" ri — ^ Tl (fniax ^min 

Taking 7V min ss L/^ max and iV max ~ L/£ m i n , ([17) simplifies to 

. f . 



A/ - ^ max 

2 I ^ 



max -t- mi ii 



O (L 



(18) 



From each node, at most ^ max — ^ m in transitions may emerge. Thus, 
from {T8j, the number of transitions A/t may also approximated as 

Mt = (L 2 ) . 

C. Limitations 

The hold-and-sync technique presented in 11141 for performing FS 
is based on the knowledge of the beginning and length of the burst. 
This requires an error-free decoding of the headers of lower protocol 
layers, which contain this information. This may be done using 
methods presented in (171 , which enable the lower layer to forward 
the burst to the layer where it is processed. The main drawback of this 
FS technique in terms of implementation is the increase in memory 
requirements for storing soft information. This is estimated in 1181 , 
1191 to be three to four times more. However, the plain trellis-based 
FS algorithm, as described above, requires buffering the whole burst 
which induces some buffering and processing delays proportional to 
L 2 , see j I St . To alleviate these problems, a new low-delay and less- 
complex variant of the previously presented FS technique is now 
proposed. 




Fig. 2. ST for the m-th decoding window, the original trellis is in gray 



IV. Sliding Trellis-based FS 

In classical SW-BCJR methods, decoding is done within a window, 
which at each step is shifted bit-by-bit 1161 or by several bits 1201 . 
From one window to the next, the results obtained during the forward 
iteration are reused, contrary to those of the backward iteration. The 
number of bits the window is shifted at each iteration determines the 
trade-off between complexity and efficiency. 

Contrary to the trellis for a convolutional code, the trellis consid- 
ered in Figure Q] has a variable number of states for each value of 
the frame index n. One may apply directly the SW ideas, but due to 
the increase of the size of the trellis (at least for small values of n), 
this would still need very large trellises to be manipulated, with an 
increased computation time. Here, a ST-based approach is introduced: 
a reduced-size trellis is considered in each decoding window. As 
in 1201 , some overlapping between windows is considered, in order 
to allow better reuse of already computed quantities and to allow 
complexity-efficiency trade-offs. 

A. Sliding Trellis 

In the proposed ST-based approach, a burst of L bits is divided into 
M overlapping windows with sizes L m , m = 1, M. For each of 
these windows, the bits from e m _i + 1 to e m ~i +i m are considered, 
where e m _i is the bit index of the last bit of the last frame deemed 
reliably synchronized in the ra — 1-th window. 

A ST moves from window to window to perform decoding. One 
such ST is illustrated in Figure [2] Let n and £ be the local trellis 
coordinates. Once P{S n a = ^ly^^+i™) is evaluated, one can 
apply the estimators 41}, and to determine the number of 
frames N m in the m-th window (including the last truncated frame), 
the beginning, and the length of each frame. For the m-th window 
(m < M), among the N m decoded frames, only the first N„\ 
frames are considered as reliably synchronized, since enough data 
and redundancy properties have been taken into account. Truncated 
frames, especially when the HEC has been truncated, and the frame 
immediately preceding such frames, are not considered reliable. Thus, 
only the complete frames ending in the first L m — ^ max — £h 
bits of the window are considered as reliable. The unreliable region 
towards the boundary of the window is dashed in Figure [2] 

The initialization of (3 is performed as in Section [Till since 
no knowledge from the previous window can be exploited. The 
initialization of a and the evaluation of 7 towards the window 
boundary may depend on the location of the window inside a burst. 
Three types of window locations are considered: the first window at 



the start of a burst, the intermediate windows in the middle of the 
burst, and the last window at the end of the burst. 

The first window (m = 1) contains L m < L bits and starts 
at eo = 0. The decoding approach, including the initialization of 
a for this first window is similar to that presented for the trellis- 
based approach. An exception is the computation of 7™ (£',£), where 
two cases have again to be considered. The first corresponds to 
normal data frames, leading to ~/„ (£' , L m ). The second accounts for 
truncated data frames towards the boundary of the window, leading 
to 7^ (£' , L m ), which is detailed in Section |IV-B| 

The intermediate windows (1 < m < M) contain the bits from 
Em-i + 1 to e m _i + Lm < L, see Figure|2] The bit index e m _i, of 
the last bit of the last frame deemed reliably synchronized (i.e., the 
N m _ 1 -th frame) in the m — 1-th window, corresponds in the local 
coordinates of the m — 1-th ST to £ — e m _i — £ m -2- The m-th 
ST starts at the local coordinates (n = N m _ 1 ,£ = e m -i — Sm-2) 
of m — 1-th ST. The computation of 7^ (£',£) for an intermediate 
window is identical to that of the first window. For the initialization 
of cC (I) , following the idea of the SW-BCJR decoder (16), (21], up 
to £ m ax initial values for a™ (I) are propagated from the m — 1-th 
window to the m-th window, see Section IIV-CI This allows a better 
FS in case of erroneous FS in the m — 1-th window. 

The last window (m = M) has no incomplete frame at its 
end. Only the presence of a padding frame has to be taken into 
consideration. The decoding is performed as in the trellis-based 
approach (SectionlriH. except for the initialization of cxq (J), which 
is similar to that of the intermediate window case. 

Note that the m-th and m + 1-th windows overlap over L° m bits, 
with £ h + £ max < L° m < £ h + 2£ max ■ 

B. Evaluation of "f n 

When m < M, transitions corresponding to truncated frames have 
to be considered at the end of the window. When the size of the 
truncated frame is larger than £h, the header is entirely contained in 
the truncated frame. In this case 7^ {£' , L m ) — 7^ (£' , Lm) , with 

ll, {£' ,L m ) =p(S™ = L m |S™_i = £') y (yp +i) ■ (19) 

In H9\ , since truncated frames have to be considered, 
p (S™ = L m |5^_! = £') is given by (3}. Moreover, the length of 
the frame, i.e., the content of the length field u n , is now only known 
to be between max(L m — £',£min) and ^ max bits. Thus 

J (y^.x^J = P (y fc |k) Y, p (yp|p) p (p) 

p 

£ P(u(£))J2(P(yu\u(£))P(y \o) 



<=max(i m -I 1 ,£ min ) 



P(y c |c = f (k,u(*),o))P(o)). 



(20) 



When the size of the truncated frame is strictly less than £h, for the 
sake of simplicity, we assume that all bits of the truncated header are 
equally likely. In such case 7™ (£' , L m ) = 7^ (£', L m ) , with 

jl(F,L m ) = p (5« = L m \S%Li = £') 

E p (y^J4+i) p - (2D 

where p = -MS^-i = ?) is still given by $3$ and P(x|'™ 1 ) = 

2 , since all beginning of headers are assumed equally likely. 

For m = M, the evaluation of j^f is as in Section lTlI-AI 



L (bytes) 


1800 


8000 


16000 


24000 


Trellis-based (# of Nodes) 


24300 


480000 


1920000 


4320000 


ST-based (# of Nodes) 


17400 


77200 


154450 


231700 


Complexity Gain 


1.4 


6.2 


12.5 


18.6 



TABLE I 

FS Complexity Comparison for L n 



480 + L° 



C. Initialization of a in the sliding trellises 

In the SW-BCJR algorithm proposed in [ 16|, the a m s evaluated in 
the m-th window are deduced from those evaluated in the m — 1-th 
window. Here, since the number of states S n evolves with n, a™ 
cannot be obtained that easily from qJ™ -1 . 

In the m — 1-th window, one has evaluated a™^ 1 (£), with ^ 
£ ^ Lm-i and n ^ [I/m-i/^min] • We choose to propagate at 
most / max values of a from n = A r „_ 1 in the rn — 1-th window to 
n = in the m-th window (for £ = 0, / max — 1) as follows 



a {£) = not- 



(£«l-l — £m-2 + • 



(22) 



where k is some normalization factor chosen such that the a™(£)s 
sum to one. This allows the first frame of the m-th window to start 
at any bit index between e m _i + 1 and e m _i + / max . 

D. Complexity 

Consider M windows of approximately the same size L w , which 
are overlapping on average on L° = £h + 1.5/max bits. For 
sufficiently large L, one has to process M w L ^ L _ L o overlapping 
windows, each one with 

AC ~ (LW)2 



(23) 



— £ ■ 

ax 1 nun 
2 \ ^max^min 

nodes. The total number of nodes to process is then 



MAC 



L 



{V 



rw_To O \ f . 

±J ±J \ t-max^-min 

This decoding complexity is smaller than that of Section UlI-BI A 
comparison for different values of burst size L is provided in Table U 
for window size L w = 480+L°. One can observe that the complexity 
gain increases with the size of the burst. 

Choosing small values for L w reduces the latency as well as 
the complexity at the cost of some sub-optimality in the decoding 
performance. Note that L w cannot be chosen too small (smaller than 
£h + 2£max) to ensure at least one reliable FS in each window. 

V. Simulation results 

In the WiMAX standard (7), the downlink (DL) sub-frames are 
divided into bursts. Each burst can contain multiple concatenated 
fixed-size or variable-size MAC frames: it is filled with several MAC 
frames, until there is not enough space left. Padding bytes (OxFF) are 
then added Q at the end of the burst. Each MAC frame begins with a 
fixed-length header, followed by a variable-length payload and ends 
with an optional CRC. As we are considering only the DL case, 
where the connection is already established, MAC frames belonging 
to a burst contain only Convergence Sublayer (CS) data, so only the 
Generic MAC header Q is possible inside a burst. 

Some assumptions are made in what follows for the sake of 
simplicity. CRC, ARQ, packing, fragmentation, and encryption are 
not used for the MAC frames inside the burst. Some fields are already 
fixed in a MAC header, and with the considered situation, fields 
such as Header Type (HT), Encryption Control (EC), sub-headers 
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Fig. 3. FS methods for transmission over Rayleigh channel 



and special payload types (Type), Reserved (Rsv), CRC Indicator 
(CI), and Encryption Key Sequence (EKS) remain constant. The 
LEN field, representing the length in bytes of the MAC frame, and 
the Connection IDentifier (CID) have variable contents. The Header 
Check Sequence (HCS), an 8-bit CRC, is used to detect errors in the 
header and is a function of the content of all header fields. 

The considered simulator consists of a burst generator, a BPSK 
modulator, a channel, and a receiver. Simulations are carried over 
Rayleigh fading channels, where the modulated signal is subject to 
zero mean and unit variance fast (bit) Rayleigh fading plus zero-mean 
AWGN noise. For performance analysis, Erroneous Frame Location 
Rate (EFLR) is evaluated as a function of the channel Signal-to-Noise 
Ratio (SNR). It should be noted that in order to recover a frame 
correctly both ends of the frame have to be correctly determined. 

In our simulations, L = 1800 bytes. Since WiMAX MAC frames 
are byte-aligned, i.e., the LEN field of the frame is in bytes and all 
MAC frames contain an integer number of bytes, a, ft, and 7 are 
evaluated for Is corresponding to the beginning of bytes. Data frames 
are randomly generated with a length uniformly distributed between 
^min = 50 bytes and i max = 200 bytes. If there is not enough space 
remaining in the burst, a padding frame is inserted to fill the burst. 
The burst is then BPSK modulated and sent over the channel. 

Simulation results for the trellis-based FS technique (Section Hit, 
the ST-based approach (Section [IV}, a FS based on hard decision on 
the received bits, and a state-of-the-art on-the-fly technique (denoted 
by MU, for modified Veda's method) described in (8) are shown 
in Figure [3] MU technique involves a three-state automaton for FS 
and uses the HCS as an error-correcting code. Here, the method in 
|8| has been modified to cope with short (8-bit) HCS, where several 
candidates for two-bit error syndromes are possible. 

For the ST-based approach, the burst is divided into three windows, 
with Li = 600 bytes, L 2 = 600 + L° bytes, and L 3 = 600 + 
L2 bytes. Compared to the trellis-based FS, the ST-based approach 
shows a slight performance degradation of 0.5 dB, but reduces the 
delay and the computational complexity. On average, the overlap is 
about 277 bytes and a decrease in complexity by a factor of 1.7 
is observed. The computational complexity of the on-the-fly MU 
technique is much smaller than that of the ST-based approach, at 
the cost of a noticeable loss in efficiency. 

VI. Conclusion 

A reduced-complexity, low-delay, and efficient ST-based technique 
for the robust FS of aggregated frames is presented. Compared to 
the trellis-based FS algorithm proposed in 1141 . the FS delay and 



computational complexity are significantly reduced. The price to be 
paid is a slight performance degradation. 

This technique has been applied to the FS of WiMAX MAC bursts. 
A significant gain in performance is obtained for Rayleigh fading 
channels compared to classical FS. Extensions to perform on-the- 
fly FS, by utilizing the features of Ueda's method and at the same 
time exploiting the structural properties of the headers of frames, are 
currently under investigation. 
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